Large Vocabulary Arabic Online Handwriting Recognition System

نویسندگان

  • Ibrahim Abdelaziz
  • Sherif Abdou
  • Hassanin Al-Barhamtoshy
چکیده

Online handwriting recognition of Arabic script is a difficult problem since it is naturally both cursive and unconstrained. The analysis of Arabic script is further complicated due to obligatory dots/stokes that are placed above or below most letters and usually written delayed in order. In addition, Arabic language is rich in morphology and syntax which makes it a must for a good online handwriting system to handle large vocabulary lexicon. This paper introduces a Hidden Markov Model (HMM) based system to provide solutions for most of the difficulties inherent in recognizing Arabic script. A new preprocessing technique for the delayed strokes to match the structure of the HMM model is introduced. This system use context dependent tri-Grapheme models to provide more detailed representation for the differences between the writing units. Also the used HMM models are trained with Writer Adaptive Training (WAT) to minimize the variance between writers in the training data. The models discrimination power is enhanced by a discriminative training technique which is the Minimum Grapheme Error (MGE) training. Also the Gaussian mixtures are splitted gradually to have better representation for the features space. The system results are enhanced using an additional post-processing step to rescore multiple hypothesis of the system result with higher order language Ibrahim Hosny Faculty of Computers and Information, Cairo University E-mail: [email protected] Sherif Abdou Faculty of Computers and Information, Cairo University E-mail: [email protected] Hassanin Al-Barhamtoshy Faculty of Computing and Information Technology, King Abdulaziz University E-mail: [email protected] model and cross-word HMM models. The system performance was evaluated using two different databases covering small and large lexicons. The proposed system shows a promising performance compared with stateof-art systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AltecOnDB: A Large-Vocabulary Arabic Online Handwriting Recognition Database

Arabic is a semitic language characterized by a complex and rich morphology. The exceptional degree of ambiguity in the writing system, the rich morphology, and the highly complex word formation process of roots and patterns all contribute to making computational approaches to Arabic very challenging. As a result, a practical handwriting recognition system should support large vocabulary to pro...

متن کامل

A Tool to Develop Arabic Handwriting Recognition System Using Genetic Approach

Problem statement: Significant movement has been made in handwriting recognition technology over the last few years. Up until now, Arabic handwriting recognition systems have been limited to small and medium vocabulary applications, since most of them often rely on a database during the recognition process. The facility of dealing with large database, however, opens up many more applications. A...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

ON - LINE UNCONSTRAINED HANDWRITINGRECOGNITIONBASED ON PROBABILISTIC TECHNIQUESHomayoon

This paper discusses a probabilistic on-line handwriting recognition scheme, based on Hidden Markov Models (HMM's), and its implementation for recognizing handwritten words captured from a tablet. Statistical methods, such as HMM's have been used successfully for speech recognition. These methods have recently been applied to the problem of handwriting recognition as well. This paper, discusses...

متن کامل

Improved On-Line Handwriting Recognition Using Context Dependent Hidden Markov Models

This paper presents the introduction of context dependent Hidden Markov Models for cursive, uncon-strained handwriting recognition with large vocabularies. Since context dependent models were successfully introduced to speech recognition ((1], 2], 3]), it seems obvious, that the use of trigraphs could also lead to improved on-line handwriting recognition systems 4]. In analogy to triphones in s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1410.4688  شماره 

صفحات  -

تاریخ انتشار 2013